Multi-UAV Tactic Switching via Model Predictive Control and Fuzzy Q-Learning
نویسندگان
چکیده
Corresponding Author: Hafez, Ahmed, Taimour, PhD Head of Aircraft Armament Department, Tel : +201005115209 Email: [email protected] Abstract Teams of cooperative Unmanned Aerial Vehicles (UAVs) require intelligent and flexible control strategies to allow the accomplishment of a multitude of challenging group tasks. In this paper, we introduce a solution for the problem of tactic switching between the formation flight tactic and the dynamic encirclement tactic for a team of N cooperative UAVs using a predictive decentralize control approach. Decentralized Model Predictive Control (MPC) is used to generate tactics for a team of N UAVs in simulation and real-world validation. A high-level Linear Model Predictive Control (LMPC) policy is used to control the UAV team during the execution of a desired formation, while a combination of decentralized LMPC and Feedback Linearization (FL) is applied to the UAV team to accomplish dynamic encirclement. The decision of switching from one tactic to the other is derived by a fuzzy logic controller, which, in its turn, is derived by a Reinforcement Learning (RL) approach. The main contributions of this paper are: (i) solution of the problem of tactic switching for a team of cooperative UAVs using LMPC and a fuzzy controller derived via RL; (ii) simulations demonstrating the efficiency of the method; and (iii) implementation of the solution to on-board real-time controllers on QballX4 quadrotors.
منابع مشابه
UAVs in Formation and Dynamic Encirclement Via Model Predictive Control
Switching between the formation flight tactic and the dynamic encirclement tactic for a team of Unmanned Aerial Vehicles (UAVs) is done using a decentralized approach. A team formed from N UAVs, accomplishes a line-of-breast formation then dynamic encirclement around a desired target. A high-level Linear Model Predictive Control (LMPC) policy is used to control the UAV team during the execution...
متن کاملModel Predictive Controller Design for a Novel Moving Mass Controlled Bi-rotor UAV
This paper presents design and implementation of Model Based Predictive Controller (MPC) for a novel Bi-Rotor Moving Mass Controlled (MMC) Unmanned Aerial Vehicle (UAV). Due to the strict constrained control inputs in this type of UAV, it is necessary to take into account the constrained controller design and un-constrained control methods are not applicable. MPC controller which is designed ba...
متن کاملReinforcement learning based feedback control of tumor growth by limiting maximum chemo-drug dose using fuzzy logic
In this paper, a model-free reinforcement learning-based controller is designed to extract a treatment protocol because the design of a model-based controller is complex due to the highly nonlinear dynamics of cancer. The Q-learning algorithm is used to develop an optimal controller for cancer chemotherapy drug dosing. In the Q-learning algorithm, each entry of the Q-table is updated using data...
متن کاملAn Online Q-learning Based Multi-Agent LFC for a Multi-Area Multi-Source Power System Including Distributed Energy Resources
This paper presents an online two-stage Q-learning based multi-agent (MA) controller for load frequency control (LFC) in an interconnected multi-area multi-source power system integrated with distributed energy resources (DERs). The proposed control strategy consists of two stages. The first stage is employed a PID controller which its parameters are designed using sine cosine optimization (SCO...
متن کاملMini/Micro-Grid Adaptive Voltage and Frequency Stability Enhancement Using Q-learning Mechanism
This paper develops an adaptive control method for controlling frequency and voltage of an islanded mini/micro grid (M/µG) using reinforcement learning method. Reinforcement learning (RL) is one of the branches of the machine learning, which is the main solution method of Markov decision process (MDPs). Among the several solution methods of RL, the Q-learning method is used for solving RL in th...
متن کامل